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Abstract 

Background: Whole genome sequencing of bacteriophages suitable for biocontrol of pathogens in food products 
is a pre-requisite to any phage-based intervention procedure. Trials involving the biosanitization of Salmonella 
Typhimurium in the pig production environment identified one such candidate, OSH19. 

Results: This phage was sequenced and analysis of its 157,785 bp circular dsDNA genome revealed a number of 
interesting features. OSH19 constitutes another member of the recently-proposed Myoviridae ViOl -like family of 
phages, containing 5. Typhi-specific ViOl and Shigella-speafic SboM-AG3. At the nucleotide level OSH19 is highly 
similar to phage ViOl (80-98% pairwise identity over the length of the genome), with the major differences lying in 
the region associated with host-range determination. Analyses of the proteins encoded within this region by 
OSH19 revealed a cluster of three putative tail spikes. Of the three tail spikes, two have protein domains associated 
with the pectate lyase family of proteins (Tsp2) and P22 tail spike family (Tsp3) with the prospect that these enable 
Salmonella 0 antigen degradation. Tail spike proteins of ViOl and SboM-AG3 are predicted to contain conserved 
right-handed parallel p-helical structures but the internal protein domains are varied allowing different host 
specificities. 

Conclusions: The addition or exchange of tail spike protein modules is a major contributor to host range 
determination in the ViOl -like phage family. 

Keywords: Phage biocontrol, biosanitization, bacteriophage genomics, Salmonella Typhimurium, Myoviridae, P22- 
like tail spike, pectate lyase tail spike domain, lipopolysaccharide 
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Background 

The use of virulent bacteriophages (phages) as biological 
control (biocontrol) agents against bacterial pathogens is 
an expanding field of research aimed at producing sus- 
tainable solutions for the control of these pathogens, 
and to circumvent problems such as those associated 
with the development of multidrug-resistant bacteria [1]. 
The antimicrobial activities of phages committed to the 
cellular lysis of a range of bacterial pathogens have been 
reported, which include food pathogens such as Campy- 
lobacter jejuni [2-5], Escherichia coli [6-8], and various 
Salmonella enterica serovars [4,9,10]. However, despite 
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the wealth of data regarding the efficacy of phages dur- 
ing intervention studies, relatively few phage-derived 
products have been developed sufficiently for commer- 
cial application. Only in the last few years has there 
been an extension of lab-based trials into the food pro- 
duction environment, where perhaps the best example is 
the recognition of the efficacy and the granting of 'gen- 
erally recognized as safe' (GRAS) status to bacterio- 
phages targeting Listeria monocytogenes by the United 
States Food and Drug Administration [11]. There are 
two products of note now available commercially-List- 
Shield™ (Intralytix Inc., USA) a phage cocktail compris- 
ing virulent phages with broad activity against L. 
Monocytogenes, and Listex P100™ (EBI Food Safety, 
Netherlands). Following GRAS classification both List- 
Shield™ and Listex P100™ are now viewed as safe to 
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be applied as food biopreservatives on ready-to-eat foods 
in the USA. Phage PI 00 (the active component of Listex 
P100) was initially characterized at the genetic level and 
in oral toxicity studies by Carlton et al [12]. These stu- 
dies showed that P100 had no undesirable genes within 
its genome, and caused no ill effects when administered 
to rats. Many studies reporting the efficacy of Listex 
P100 against L. monocytogenes in various food produc- 
tion settings are now available in the literature [13-15]. 

If more phage-based applications are to reach standards 
where they are deemed fit for human/animal consumption 
then certain matters pertaining to safety must be taken 
into consideration. For instance, phages that show poten- 
tial during preliminary studies must adhere to strict cri- 
teria if they are to be developed further as antimicrobial 
agents [16]. Most importantly, the selected phages must 
not possess genes associated with virulence, or those that 
may enhance the pathogenic profile of its target [17]. 
Many examples of phages that are recognized as being 
involved in such processes are known, for example, the 
Shiga toxin-encoding (Stx) phages-a key virulence factor 
of Shiga-toxigenic E. coli (STEC)-are the causative agents 
of haemolytic uraemic syndrome (HUS), a major contribu- 
tor to disease associated with STEC infection [18,19]. Also, 
temperate phages that have the potential to form lysogens 
with their host need to be eliminated from trials at the 
earliest opportunity. Integration into, and excision out of 
the host genome, can lead to the transfer of genes between 
the phage and bacterium potentially altering the genetic 
profile of both [20]. These undesirable traits are most 
often associated with phages belonging to the Podoviridae 
(icosahedral head with a short non-contractile tail e.g. Sal- 
monella phage P22) or Siphoviridae (icosahedral head 
with a flexible non-contractile tail e.g. phage X) families. 
However, members of the Myoviridae (icosahedral head 
and contractile tail e.g. phage T4) are more often asso- 
ciated with an obligately lytic lifecycle. All of these mor- 
phological characteristics can be easily discerned when 
phages are viewed under a transmission electron micro- 
scope, although assumptions as to the genetic nature of a 
phage cannot be made on morphological characteristics 
alone. Whole genome sequencing of phage isolates 
intended for use as biocontrol agents is now considered to 
be the 'gold standard' in terms of transferring phage-based 
applications from the laboratory to everyday use. 

Here we report the complete genome sequence of 
OSH19, a lytic bacteriophage adapted to infecting a 
number of different S. Typhimurium serovars. OSH19 
has shown great potential as a biocontrol agent against 
S. Typhimurium U288, the most prevalent serovar 
found in UK pig production premises [21]. S. Typhi 
phage ViOl was the first of a recently-proposed new 
lineage of Myoviridae to be described. Genetic analyses 
of the OSH19 genome reveal that it is a close relative of 



phage ViOl, in terms of both DNA and protein 
sequences. However, phage ViOl appears to be restricted 
to infecting S. Typhi, possibly due to the presence of a 
virulence (Vi) capsule antigen-degrading acetyl esterase 
domain incorporated into one of its three tail spikes 
[22]. The other completely sequenced member of the 
ViOl-like phage lineage is SboM-AG3 that is restricted 
to the infection of Shigella spp. [23]. No genes asso- 
ciated with either toxicity or lysogeny have been found 
within the OSH19 157,785 bp circular dsDNA genome, 
nor in any of the ViOl family of phages. A lack of viru- 
lence associated genes in OSH19 will allow its use as a 
biocontrol agent, aimed at reducing S. Typhimurium 
entering the food chain, and in particular S. Typhimur- 
ium U288 from pork production. 

Results 

Bacteriophage OSH19 characterization 

Bacteriophage OSH19 was originally isolated from pig 
intestinal contents and has specific activity against <S. 
Typhimurium serovars [24]. Morphologically the phage 
has an icosahedral head with a contractile tail indicating 
it to be a member of the Myoviridae (Figure 1). A com- 
parison of the lytic profiles of phages OSH19 and ViOl 
against a panel of S. enterica Serovars is given in Table 
1. The data shows that as expected all the S. Typhimur- 
ium strains tested are refractory to infection by ViOl, 
whereas S. Typhi BRD948 likewise displays immunity 
towards infection by OSH19 but susceptibility to ViOl. 

Genome analysis 

The OSH19 genome was sequenced from sonicated 
DNA fragments using the Roche 454 GS FLX platform 




Figure 1 TEM image of OSH19. The presence of an icosahedral 

head and contractile tail indicate that OSH19 is a member of the 

Myoviridae. Magnification x 43,000. Bar 250 nm. 
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(17,796 reads), from which a single contig of 157,785 bp 
was generated. BlastN analysis of the OSH19 genome 
revealed two related genome sequences in the database- 
phages ViOl (Genbank Acc. No. FQ312032) and SboM- 
AG3 (Genbank Acc. No. FJ373894). The OSH19 gen- 
ome was ordered such that it could be directly aligned 
with the existing genomes of ViOl and SboM-AG3 com- 
mencing with the rllA gene. The GC-content of the 
OSH19 genome was determined to be 44.68%, which is 
slightly lower than ViOl (45.22% GC), and considerably 
lower than SboM-AG3 (50.39% GC). Alignment of the 
nucleotide sequences of OSH19 and ViOl indicate that 
they are highly similar over a large proportion of the 
genome (between 80-98% pairwise identity). However, 
for a region located between 120-130 Kb on the genome 
map this homology breaks down to approximately 70% 
identity (and in some instances the conservation is lost 
completely). This region contains a cluster of three 
putative tail spike reading frames in all of the ViOl -like 
phages sequenced to date. At the nucleotide level 
OSH19 is somewhat less similar when compared to 
SboM-AG3, with homologies ranging from 74-100% 
identity over approximately 60% of the genome. Com- 
parison of the varying degrees of homology between 
OSH19, ViOl, and SboM-AG3 are represented in Figure 
2. Otherwise highly similar nucleotide matches were 
restricted to components of Escherichia phage PhaxI 
(Genbank Acc. No. HQ259289), Serratia phage KSP90 
(Genbank Acc. No. AB452990) and Salmonella phage 
Det7 (Genbank Acc. No. AM765843). 

Initial analysis of the OSH19 genome identified 339 
potential open reading frames (ORFs). Following BlastP 
and Pfam domain analyses of each potential ORF, the 
candidate ORFs were reduced to 166. Each ORF was 
then annotated using ViOl and SboM-AG3 phages as 
reference sequences, and categorised as follows: 
Hypothetical phage proteins (81), Conserved hypotheti- 
cal phage proteins (23), DNA replication (24), Tail mor- 
phogenesis (16), Capsid/DNA packaging (7), DNA 
maintenance/circularization (6), Putative uncharacter- 
ized proteins (3), Regulators (2), Lysis (2), and Putative 
homing endonucleases (2). A comparison of the major 

Table 1 Host range assays of OSH19 and ViOl (+ = lysis 
and-= no lysis). 

Salmonella <DSH19 ViOl 

5. Typhimurium U288 + 

S. Typhimurium DT104 WT + 
S. Typhimurium DT104 NCTC 13348 

5. Typhimurium WT (Rawlings) + 

5. Typhimurium WT (Turner) + 

5. Typhimurium LT2 + 

5. Typhi BRD948 + 



proteins of OSH19 with their homologues in ViOl and 
SboM-AG3 is presented in Table 2. Figure 3 shows the 
fully annotated OSH19 genome as a circular genetic 
map. Using tRNAscan-SE, five tRNA genes were also 
identified in a 5 Kb non-coding region of the OSH19 
genome at the following nucleotide positions: Methio- 
nine tRNA (CAT) 137,703-137,735; Asparagine tRNA 
(GTT) 137,806-137,878; Tyrosine tRNA (GTA) 138,497- 
138,577; Serine tRNA (GCT) 138,657-138,742; and an 
undetermined tRNA 138,749-138,834. For phages 
OSH19, ViOl, and SboM-AG3 there exists a set of 
tRNA genes located at similar positions in their respec- 
tive genomes, some of which display a high degree of 
conservation. For example, BlastN analysis of the coding 
sequence for tRNA-Serine that is present in all three 
phages indicates 100% identity at the nucleotide level. 
Two other tRNAs (tRNA- Asparagine and tRNA-Tyro- 
sine) are also shared between the group. For OSH19 
tRNA-Asparagine, the gene is 100% identical with its 
homologue in ViOl and 95% identical to that of SboM- 
AG3, whilst the OSH19 tRNA-Tyrosine shows more 
conservation with that of SboM-AG3 (99% identical) 
than ViOl (84% identical). OSH19 also shares a tRNA- 
Methionine gene (88% identity) and a tRNA of undeter- 
mined specificity (99% identity) with ViOl that are 
absent in SboM-AG3. Phire analysis for phage regula- 
tory elements identified a number of sites in the OSH19 
genome (Table 3). All of the putative regulatory ele- 
ments are found on the non-coding strand (relative to 
the transcribed sequences within that region) however 
they appear to be associated with ORFs on the opposite 
strand. It is quite possible that these elements play a 
role in regulating OSH19 gene expression, in an as yet 
undetermined manner, but perhaps by the production of 
small RNA molecules. 

OSH19 tail spikes 

Three tail spike genes-tspl (bases 120,710-122,641c), 
tsp2 (122,702-124,876c), and tsp3 (124,992-127,088c) 
were identified during the annotation of OSH19. BlastN 
analysis of the tspl gene (1932 bp) shows that the 
nucleotide sequence has 94% identity with that of ViOl 
orf 170c (ViOl Tspl) over the first 263 bases, but 
beyond this conservation breaks down and no further 
nucleotide identities are observed between the two 
genes. A similar sequence to tspl is present at orf 00207 
(SboM-AG3 Tspl) of SboM-AG3 (64% identity over the 
first 390 bases), and a shorter stretch of homology (78% 
identity over the first 54 bases) is located within orf 
00212 (SboM-AG3 Tsp3). As with ViOl, no further 
identity can be observed with tspl following these short 
stretches of conservation. Other nucleotide alignments 
of tspl include identities with the Salmonella phage 
Det7 tail spike gene (86% identity from bases 360-460), 
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Figure 2 Artemis Comparison Tool (ACT) analysis of the genomes of SboM-AG3 (top), OSH19 (middle), and ViOl (bottom). 




and an endo-a-sialidase gene of coliphage K1F (76% 
identical from bases 351-485). Aside from these align- 
ments, which all span a region covering the first 485 
bases of tspl, no homologies were found in the database 
for the region covering bases 485-1932. Alignment of 
the tsp2 gene (2175 bp) reveals that only ViOl and 
SboM-AG3 have similar sequences within the database. 
As with tspl, the homologies all span the start of the 
gene: 85% identity (bases 1-723) with ViOl orf 171c 
(ViOl Tsp2-maturation/adhesion protein) and 70% iden- 
tity (bases 1-360) with SboM-AG3 orf 00210 (SboM- 
AG3 Tsp2). The latter portion of the tsp2 gene does not 
align with any sequences in the database. BlastN 
searches involving the tsp3 gene (2097 bp) show that 
the sequence is highly similar to the Det7 tail spike gene 
(85% identity from bases 1-782). To a lesser extent, tsp3 
shares sequence homology (84% identity between bases 
1-461) with ViOl orf 172c (ViOl Tsp3) and orf 170c 
(69% identity between bases 1-179). Apart from these 
short regions of homology, there are no further identi- 
ties between tsp3 and ViOl. Interestingly, no nucleotide 



sequence homology was found between tsp3 and SboM- 
AG3. Bases 481-667 show a conserved sequence that is 
shared between several phage genomes in the database. 
Sequence alignments over this region indicate 74-77% 
shared identities between various Salmonella phages 
including P22, ST104, ST64T, ST160, SE1, and some of 
the S. Enteritidis typing phages (SETP) [25]. Interest- 
ingly, the sequence is also present in prophage tail 
spikes of S. Typhimurium D23580, S. Typhimurium 
T000240, S. Heidelberg SL476, and S. Paratyphi A 
(strain AKU 12601). As was the case for tspl and tsp2 
no homologous sequences are found to match the latter 
region of the tsp3 gene. 

Translation of the tsp gene sequences indicate that 
Tspl is a 643 amino acid protein (predicted MW ~ 68.9 
kDa), Tsp2 is comprised of 724 amino acids (predicted 
MW ~ 78.2 kDa), and Tsp3 698 amino acids (predicted 
MW ~ 75.8 kDa). Pfam domain searches for each tail 
spike indicated significant domain matches for Tsp2 
(Pectate lyase domain-family 3 CL0268) and Tsp3 (P22 
tail spike family); however no significant domain 
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Table 2 Comparison of OSH19 proteins with those of ViOl and SboM-AG3 



CDS 



OSH19 coordinates Predicted Pfam domain 
& amino acid length MW 



Amino acid 
ID with ViOl 



Amino acid ID 
with SboM-AG3 



RIIA 
RUB 

Putative tail fibre 

DNA topoisomerase II 

DNA topoisomerase II 
(medium subunit) 

Conserved 

uncharacterized protein 
DexA exonuclease 
dCMP deaminase 



Conserved 

uncharacterized protein 

Head completion 
protein 

Putative homing 
endonuclease 

T4-like baseplate tail 
tube cap 

Baseplate wedge 
subunit 

Loader of T4-like 
helicase 

DNA ligase 

DNA primase-helicase 
subunit 

UvsX (RecA-like protein 



1-2742(913) 104,992 None 98% 54% 

2775-4358 (527) 58,980 None 97% 69% 

5502-6311(269) 27,826 Ig-like l-set domain (CL001 1) 86% 86% 

8523-10,436(637) 71,862 HATPase c (CL0025) & DNA gyrase B 96% 90% 

DNA topoisomerase IV 88% 88% 



10,357-11,769 (470) 53,939 

13,309-13,929 (206) 23,005 

14,263-14,895 (210) 24,438 

18,704-19,225 (173) 19,460 



19,227-19,634 (135) 

19,845-20,465c (206) 

20,465-2 1,205c (246) 

21,259-22,227 (322) 

22,240-22,797 (185) 

26,134-26,796c (220) 

27,281 -28,708c (475) 
3 1,451 -32,875c (474) 



dUTPase 

Thymidylate synthase 

DNA end protector 

Baseplate tail tube 

ssDNA binding protein 

Late promoter 
transcription factor 

Regulatory protein 
(FmdB family) 

Putative uncharacterized 
protein 

Baseplate hub subunit 
Tail-associated lysozyme 

Baseplate wedge protein 
NrdB 

NrdA 

PhoH-like phosphate 

starvation-inducible 

protein 

Peptidoglycan binding 
protein 

DNA primase subunit 
Conserved 

uncharacterized protein 



34,789-35,340c (183) 
35,903-36,943c (346) 
39,924-40,268c (234) 
40,488-41,624 (378) 
41,652-42,683c (343) 
43,029-43,331 c (100) 

43,264-43,509c (81) 

43,875-44,396c (203) 

45,471-46,274 (267) 
46,781-48,406 (541) 

48,471-48,851 (126) 
50,229-5 1,332c (367) 

51,403-53,730c (775) 

53,764-54,603c (279) 



15,378 

24,189 

28,205 

36,142 

21,574 

26,253 

53,501 
54,192 



) 33,1 89-34,274c (361) 40,791 



Putative Macro domain of ADP-ribose binding 92% 74% 
module (CL0233) 

None 95% 93% 

dCMP cytosine deaminase 1 family-cytidine & 97% 62% 

deoxycytidylate deaminase Zn-binding domain 

(CL0109) 

Bacterial membrane-flanked domain (DUF304) 93% 88% 

None 92% 74% 

GIY-YIG catalytic domain (CL0418) No match 34% 

T4 tail cap family 100% 94% 

Phage Gp53 family 97% 80% 

T4 helicase N family 99% 79% 



ATP-dependent DNA ligase domain (CL0078) 
DnaB-like helicase N & C terminal domains 



95% 



54.71 1 - 55,505c (264) 28,815 Peptidoglycan binding protein domain (CL0244) < 

protein of unknown function (DUF3380) 

56.71 2- 57,776c (354) 41,386 None 

63,656-64,273c (205) 22,902 T4 RegB endoribonuclease family (CL0037) 



63% 
77% 
91% 
90% 
77% 
70% 



RecA bacterial DNA recombination proteins 
(CL0023) 

21,1 14 dUTPase 2 family(CL0231) 
39,216 Thymidylate synthase family 
28,347 None 
42,658 Phage T4 Gp1 9 family 
38,896 Gp3 DNA binding protein-like domain 
11,175 None 

8732 CXXC CXXC SSSS family containing a Zn ribbon 
domain (CL0167) 

23,739 RuvC family (crossover junction endoribonuclease 
RuvC) 

30,186 T4 baseplate family 96% 67% 

59,023 Gp5 OB family & CHAP domain (CL0125) associated 98% 87% 
with peptidoglycan hydrolysis 

14,124 GPW Gp25 family (gene 25-like lysozyme) 97% 87% 

42,209 Ribonucleotide reductase small chain domain 99% 89% 

(CL0044) 

88,137 ATP-cone domain, ribonucleotide reductase IgN all 94% 89% 
alpha domain & IgC barrel domain 

31,567 PhoH-like protein family (CL0023) 100% 88% 



69% 
93% 
100% 



100% 



100% 
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Table 2 Comparison of <D?0?SH19 proteins with those of ViOl and SboM-AG3 (Continued) 



NrdA.1 

Recombination 
endonuclease subunit 

Recombination protein 
subunit 

a factor for late 
transcription 

Putative homing 
endonuclease 

RNase H 

ATP-dependent helicase 
DNA binding protein 
Conserved 

uncharacterized protein 

Superinfection exclusion 
protein 

ImpD 

Acyl carrier protein 
Conserved 

uncharacterized protein 

Putative RegA 
translational repressor 

Clamp loader for DNA 
polymerase 

Gp44 sliding clamp 
holder 

Gp45 sliding clamp 
holder 

Putative type III 
restriction enzyme (RE) 

Conserved 

uncharacterized protein 
Putative UvsY 

Tail completion protein 
Major capsid protein 

Prohead core scaffold 
protein 

Prohead protease 

Portal vertex protein of 
the head 

Tail tube protein 

GIY-YIG endonuclease 

Tail sheath protein 

Large terminase subunit 

Small terminase subunit 

Proximal tail sheath 
stabilization 

Gp14 neck protein 
Gp13 neck protein 



64,978-65,31 3c (336) 12,934 

65,622-67,955c (777) 88,531 

67,958-69,073c (371) 43,030 

69,060-69,854c (264) 30,775 

69,854-70,576c (240) 27,188 

70,587-71,1 14c (175) 19,878 

7 1,921 -73,522c (533) 60,690 

73,778-74,056c (92) 9728 

74, 140-74,93 1c (263) 28,556 

75,137-75,457c (106) 12,562 

75,736-76,407c (223) 25,665 

80,335-80,687c (110) 12,162 

84,506-84,925c (139) 15,622 

89,926-90,390c (154) 18,078 

90,420-90,842c (140) 16,200 

90,847-91, 836c (329) 37,214 

91,914-92,582c (222) 24,510 

93,293-94,792 (499) 57,802 

94,822-95,568c (248) 28,91 1 

95,568-96,023c (151) 17,991 

96,062-96,559c (165) 18,542 

101,588-1 02,910c 48,059 
(440) 

103,002-1 03,841c 30,898 
(279) 

103,888-1 04,556c 24,482 
(222) 

105,1 02-1 06,784c 63,094 
(560) 

106,852-1 07,385c 19,912 
(177) 

107,41 6-1 07,874c 17,316 
(152) 

107,933-1 09,828c 68,439 
(631) 

109,881 -11 2,091c 84,547 
(736) 

11 2,072-1 12,752c 24,825 
(226) 

11 2,755-1 13,453c 26,979 
(232) 

113,456-1 14,097c 24,759 
(213) 

114,400-1 15,209c 31,142 
(269) 



None 94% 92% 

None 98% 72% 

None 98% 91% 

None 90% 77% 

None No match 36% 

RNase H domain (CL021 9) 97% 71% 

None 99% 86% 

Bacterial DNA-binding protein domain 100% 86% 

SPFH domain/Band 7 family (CL0433) 98% No match 

None 95% No match 

None 95% 74% 

Phosphoantetheine (PP)-binding family (CL0314) 92% 61% 

Protein of unknown function (DUF3268) 90% 84% 

Bacteriophage translational regulator 99% 90% 

None 98% 76% 

ATPase family (CL0023) 99% 87% 

Gp45 sliding clamp C terminal 99% 82% 

Type III RE subunit (CL0023) & helicase conserved C 95% 81% 
terminal domain (CL0023) 

PD-(D/E)XK nuclease superfamily 97% 86% 

UvsY protein family (recombination, repair, & ssDNA 93% 85% 
binding protein 

T4Gp1 9 family 98% 80% 

Gp23 major capsid protein family 98% 94% 

None 96% 74% 

Peptidase U9 family (CL0201) 99% 92% 

T4Gp20 family 99% 81% 

T4Gp1 9 family 100% 98% 

GIY-YIG catalytic domain endonuclease family 95% 32% 
(CL0418) 

Phage sheath 1 family 98% 92% 

Terminase 6 family 95% 85% 

DNA packaging family (terminase DNA packaging 98% 74% 
enzyme) 

None 98% 80% 

T4 neck protein family 100% 82% 

None 99% 87% 
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Table 2 Comparison of <D?0?SH19 proteins with those of ViOl and SboM-AG3 (Continued) 


Tail spike 1 


120,71 0-1 22,641c 
(643) 


68,851 


None 


67% (res. 1- 
171 only) 


62% 
only) 


res. 


1-190 


Tail spike 2 


122,702-1 24,876c 
(724) 


78,194 


Pectate lyase C (CL0268) 


86% (res. 1- 
276 only) 


73% 
only) 


res. 


1-161 


Tail spike 3 


124,992-1 27,088c 
(698) 


75,785 


P22 tail spike family 


86% (res. 1- 
154 only) 


46% 
only) 


res. 


1-179 


Haemolysin-type calcium 
binding protein 


127,1 68-1 30,209c 
(1013) 


108,436 


None 


81% (res. 1- 
418 only) 


63% 
only) 


res. 


1-404 


Baseplate wedge 
subunit 


132,287-1 34,080c 
(597) 


66,068 


None 


98% 


79% 






DNA polymerase 


145,442-148,438 (998) 


116,583 


DNA polymerase family B exonuclease domains 
(CL0194&CL0219) 


98% 


86% 






Putative uncharacterized 
protein 


148,776-149,615 (279) 


32,346 


NT5C family 5' nucleotidase deoxypyrimidine 
(CL0137) 


97% 


80% 







similarities were identified for Tspl. Betawrap analyses 
of the amino acid sequences of each tail spike from 
OSH19, ViOl, and SboM-AG3 shows that each of these 
proteins potentially have the ability to form right- 
handed parallel p -helical structures. A comparison of 
each OSH19 tail spike protein sequence with those of 



ViOl and SboM-AG3, and other homologous proteins in 
the database revealed the N- terminal residues (1-130) of 
OSH19 Tspl are highly conserved with the correspond- 
ing regions in ViOl Tspl (78% ID), SboM-AG3 Tspl 
(75% ID), and to a lesser degree in ViOl Tsp3 (56% ID). 
This N-terminal homology is also shared with the 




I I Hypothetical phage proteins (81) 

I I Tail morphogenesis (16) 

| DNA replication (24) 

I | Conserved uncharacterized proteins (23) 

I CapsidVDNA packaging (7) 

□ Lysis (2) 

| Putative homing endonucleases (2) 

| Putative uncharacterized proteins (3) 

I I Regulators (2) 

| DNA maintenance (6) 

□ tRNA (5) 



Figure 3 A schematic circular map of the OSH19 genome. 
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Table 3 Putative phage regulatory elements identified by 
Phire analysis. 

Putative phage regulatory elements 



From To Sequence 



56,300 


56,319 


TAGCCATGATGTAATCCTC 


79,926 


79,945 


CGGTCATGATGTAmCOT 


99,487 


99,506 


TGGTCATGGTGTAGTOOT 


101,431 


101,450 


GTCCCATGATGTAmCTO 


104,872 


104,891 


mCCATCATGCATCTCOT 


109,822 


109,841 


TOCCATAATATCATOOT 


144,434 


144,453 


TOCCATGATGTAmCOT 



Salmonella phage Det7 tail spike (56% ID), and a tail 
fibre protein from Enter obacter phage Ecpl (42% ID). A 
sequence motif present in OSH19 Tspl G-G-V-G-L-G- 
A-W (beginning at residue 143) possibly signifies the 
boundary at which the catalytic domain of the tail spike 
begins. No such sequence motif is present in ViOl Tspl, 
however similar sequence motifs are present at analo- 
gous positions within SboM-AG3 Tspl (G-G-L-S-S-S- 
N-W) and ViOl Tsp3 (G-G-V-G-T-G-A-W). ClustalW2 
alignments of phage tail-associated proteins containing 
derivatives of these motifs at their N-terminal bound- 
aries are shown in Figure 4A. The C-terminus of 
OSH19 Tspl (residues 300-643) produced no significant 
alignments with any known proteins. BlastP analyses of 
ViOl Tspl and SboM-AG3 Tspl was performed to iden- 
tify any functionally-related proteins. However, as with 
OSH19 Tspl, ViOl Tspl produced no significant align- 
ments over the C-terminal region with any protein 
sequences in the database. 

Alignment of OSH19 Tsp2 with ViOl Tsp2 (matura- 
tion/adhesion protein) and SboM-AG3 Tsp2 shows a 
high degree of conservation from residues 1-160. At this 
point SboM-AG3 sequence homology (73% ID over resi- 
dues 1-162) breaks down with no further identity to 
either ViOl or OSH19 Tsp2. Residues 100-403 in OSH19 
Tsp2 show a weak but significant relationship with a 
phage structural protein/putative tail fibre (orf 00213) 
present in SboM-AG3. For OSH19 Tsp2 and ViOl Tsp2, 
homology continues over the first 276 residues (86% ID) 
at which point the two sequences completely diverge 
from each other. Another region of low homology (resi- 
dues 100-316) is also found with a putative tail fibre pro- 
tein encoded in the ViOl genome (ViOl orf 173c). A 
pectate lyase protein family domain (family 3-CL0268) 
spanning residues 291-549 was identified during Pfam 
analysis of OSH19 Tsp2. The presence of this motif was 
further confirmed during BlastP analysis, with a number 
of alignments being made with various glycoside degrad- 
ing enzymes from a wide range of microorganisms. No 
significant alignments with the C-terminus of OSH19 
Tsp2 were identified (residues 557-724). 



OSH19 Tsp3 alignment with ViOl Tsp3 shows a high 
degree of sequence conservation between their N-term- 
inal residues 1-154 (86% ID). However, the correspond- 
ing sequence in SboM-AG3 Tsp3 shows similarity over 
the first 19 residues before a deletion of 63 amino acids, 
after which the homology is restored between the three 
tail spike sequences. Interestingly, SboM-AG3 Tspl 
shows a higher degree of conservation with OSH19 
Tsp3 over residues 1-139 (46% ID) than does SboM- 
AG3 Tsp3. The boundary motif G-G-V-G-L-G-A-W 
appears in OSH19 Tsp3 at residues 143-150 with analo- 
gous sequences in ViOl (G-G-V-G-T-G-A-W) and 
SboM-AG3 (G-G-V-S-S-S-A-W), after which the 
sequence homologies between all three tail spikes break 
down. Pfam domain analysis of O SHI 9 Tsp3 identified a 
protein domain (residues 159-698) with significant 
homology to the P22 tail spike family (PF09251). BlastP 
analysis produced alignments with Det7 (the first 
reported Myovirus with a podoviral tail spike), P22, and 
a number of P22-like phages. The highest degree of 
homology was found for the tail spike of Det7 (77% ID 
over the first 300 residues). The sequence motif marking 
the domain boundary in OSH19 Tsp3 (G-G-V-G-L-G- 
A-W) is conserved in Det7. P22 tail spike-like sequences 
feature three conserved residues that function as cataly- 
tic components of the endo-rhamnosidase activity asso- 
ciated with P22 tail spike protein [26,27]. These residues 
are located in the substrate binding cleft of the P22 tail 
spike and are conserved in OSH19 Tsp3 (Figure 4B: 
Glu-359, Asp-392 and Asp-395). 

Discussion 

Bacteriophage OSH19, a candidate for use as a biocon- 
trol agent against Salmonella Typhimurium, was sub- 
jected to whole genome sequencing. This process is now 
viewed as a pre- requisite to using phages as therapeutic 
agents, especially if they are to be introduced into a 
food production environment. The presence or absence 
of genes associated with toxicity or lysogeny will ulti- 
mately govern whether or not a potential phage candi- 
date is safe for commercial use. OSH19 was found to 
contain no undesirable genes in its 157,785 bp genome 
making it an excellent candidate for such applications. 
Also, OSH19 constitutes a new addition to the ViOl -like 
phage family (along with S. Typhi phage ViOl and Shi- 
gella phage SboM-AG3). In many respects OSH19 is 
quite similar to ViOl with the addition of a putative 
homing endonuclease and a putative uncharacterized 
protein. Structural genes and those associated with mor- 
phology of the phage particle are highly conserved 
between OSH19 and ViOl, and to a lesser degree with 
SboM-AG3. Analysis of the tRNA genes encoded by all 
three phages, and their location within each genome 
indicates that some of them are possibly derived from a 
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4A 



Det7 Tsp 
SETP12 Tsp 
Phil9 Tsp3 

klH Tsp63D sialidase 

K1G Tsp63D sialidase 

K1F tail fibre 

ViOl Tsp3 

Phil9 Tspl 

SboM-AG3 orf00207 

EcoDSl gpl7 

EPECal4 tail fibre 

SboM-AG3 orf 00210 

K. pneumoniae prophage tail 

Vill-El maturation/adhesion 



PKWPAGSTPASTGGVGLGAWVSVGDAAFRQEAN KKFKYSVKLSDYS 17 6 

PKVIPAASTPLTAGGISDSAWVAFGDITFRAEAD KKFKYSVKLSDFT 153 

PKTVDAGSTPETSGGVGLGAWLSVGDAAFRQEAN KKFKYSVKLSDYS 17 6 

PKTIPASSSPQSTGGVSEGAWQRAGSVVLRQQISNANGAILYPDLHVARWKDEYDPRAWG 166 
PKIIPASSSPQSTGGVADGAWQPVGSVILRQQISDVNGAILYPDLHVARWKDEYDPRAWG 166 
PKNVPAGSTPETSGGIGLGAWVSVGDAALRSQISNPEGAILYPELHRARWLDEKDARGWG 24 5 
PKVVAAGSTPDNSGGVGTGAWIPVGINSLSGDGGWSRIGSEFGKSLKEHIQSLPSLSAVK 18 9 

PKTVAPGSTPATTGGVGLGAWVSVGDVSLRSELS KEDGFKYIGYS 172 

PKSVNAGGTIANTGGLSSSNWVKLDPDALRDELY SGFYRLT 169 

PKEVPAGSTPSTTGGVGLGAWVSVGDASLRSNLIS DNSINGDALI 2 30 

PKDVPENSTPESTGGVSLGAWVSVGDASLRSDLIS QETDKGS 2 93 

PKTVNKNSTPQSTGGVSSSAWVSVESKTLGSELLTG EKASLVG 111 

PKTVPIASSPATTGGVRAGGWAYTSDAAIRQNLAADSGAKLVGGLNYVTPEMSGFVTGAG 152 
PKTVPPGSTPYSSGGTGNGKWMAVDVADVLRKDLAKPTGANLIGIHDTTVGEVLRQKVYI 17 3 



4B 



ST64T Tsp g|sgvktwqgtvgsttsrnynlqfrdswiypvw|gf|lgad 400 

ST104 Tsp GESGVKTWQGTVGSTTSRNYNLQFRDSVVIYPVWDGFDLGAD 400 

P2 2 Tsp GESGVKTWQGTVGSTTSRNYNLQFRDSWIYPVWDGFDLGAD 400 

SETP1 3 Tsp GESGVKTWQGTVGSTTSRNYNLQFRDSAVLYPVWDGFDLGAD 421 

SETP12 Tsp GESGVKTWQGTVGSTTSRNYNLQFRDSAVLYPVWDGFDLGAD 421 

SP6 Tsp GESGFKTWQGSVGAGTSRNYNLQFRDSVALSPVWDGFDLGSD 287 

Det7 Tsp GESGFKTYQGSVGGGTARNYNLQFRDSVALSPVWDGFDLGSD 44 4 

19 Tsp3 g|sgtktyqneiggrsarnyrlvfdnittiqcyy|gi|vnad 440 



4C 



ST64T Tsp KLSKTLDSGALYSHINVGPGSGSAWTQLTAISGNTPDAVS 60 6 

ST104 Tsp KLSKTLDSGALYSHINGGPGSGSAWTQLTAISGNTPDAVS 606 

P22 Tsp KLSKTLDSGALYSHINVGPGSGSAWTQLTAISGNTPDAVS 60 6 

SETP13 Tsp KLSKTLDSGSVYSHINGGPGSGSAWTEITAISGSSPDAVS 627 

SETP12 Tsp KLSKTLDSGSVYSRINGGPGSGSAWTEITAIAGSLPDAVS 627 

SP6 Tsp KLTKTLDSGAIRSTLNGGPGTGSAWTEMTAISGSAPNAVS 4 93 

Det7 Tsp KLSATLDSGALRSHLN GYAGSGSAWTELTA LSGST PNAVS 650 

oshi9 Tsp3 mggttnpsllmtadcq|atpggeasivkl|aiqeg^^H 641 



ST64T Tsp 
ST104 Tsp 
P22 Tsp 
SETP1 3 Tsp 
SETP12 Tsp 
SP6 Tsp 
Det7 Tsp 
OSH19 Tsp3 



lkvnhkdcrgaei 
lkvnhkdcrgaei 
lkvnhkdcrgaei 
lkinrgdyravei 
lkinrgdyravei 
lkinrgdfkatei 
lkvnrgdykttei 



pfvpdiasddfikdsscflpywennstslkalvkkpngelvr 6 61 
pfvpdiasddfikdsscflpywennstslkalvkkpngelvr 661 
pfvpdiasddfikdsscflpywennstslkalvkkpngelvr 661 
pvamsalpdnavrdigsismylegd--slkalvrradgsytr 68 0 
pvamtvlpdnavrdngsislylegd--slkalvkradgsytr 68 0 
pvaptvlpdeavrdhssialyfdqe--alwalvkkpngsltr 54 6 
pisgtvlp degvldintmslyldag--a lwalirlpdgs ktr 7 03 
IlpKalehnssigfffgsdg-alrllakkpdgs^B 6 95 



ST64T Tsp LTLATL 667 

ST104 Tsp LTLATL 667 

P22 Tsp LTLATL 667 

SETP1 3 Tsp LTLA 684 

SETP12 Tsp LTLA 684 

SP6 Tsp MKLA 550 

Det7 Tsp MKLSV 708 

OSH19 Tsp3 |TL 698 

Figure 4 A-C ClustalW2 alignments of sequence motifs in tail-associated proteins (4A), the active site of P22-like tail spikes (4B), and 
the C-terminal region of P22-like tail spikes-yellow-sequence motif (residues 585-596 in P22) associated with intertwined region, 
green-five-stranded p-sheet and three-stranded (3-sheet (residues 601-664 in P22), and red-regions lacking homology in OSH19 Tsp3 

(4C). 
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common ancestor. Transmission electron microscope 
images of OSH19 (Figure 1), ViOl and SboM-AG3 indi- 
cate that their morphologies are quite similar. The intri- 
cate 'chandelier-like' arrangements of tail spikes 
attached to the baseplate are visible for all three phages 
[22,23]. It seems likely that there are more ViOl-like 
phages (such as Escherichia phage PhaxI) capable of 
infecting a diverse range of bacterial pathogens to be 
discovered for which there are commercial applications. 

The modular structure of the three tail spikes found 
in OSH19 is potentially the main driving factor behind 
host-range determination for this phage. Although no 
function can at present be assigned to Tspl, the other 
two tail spikes contain defined protein domains that 
provide evidence as to their mode of action. For Tsp2, 
the pectate lyase domain indicates that this protein may 
well function to modify or cleave glycoside bonds. The 
pectate lyase family domain (CL0268) notably contains 
proteins with pectate/pectin lyase and pectin methyles- 
terase activities. The pectate/pectin lyases are a well- 
characterized family of proteins principally involved in 
microbial plant pathogenesis. Their primary mode of 
action involves the eliminative cleavage of a- 1,4 linked 
galacturonosyl residues of pectins that are components 
of the middle lamella of plant cell walls. The three 
dimensional structure of Erwinia chrysanthemi EC16 
pectate lyase C (PelC) was first solved by Yoder et al. 
[28] and is representative of a family of proteins con- 
taining right-handed parallel P -helical structures with 
the flexibility to allow protein loops from the stacked 
coils to add functionality, for example the formation of 
the active site clefts that support enzyme catalysis 
[29,30]. Using a similar base architecture pectin methy- 
lesterase acts to de-esterify pectin to pectate. Members 
of the pectinolytic protein family are represented in pro- 
karyotic and eukaryotic microorganisms, and also func- 
tion in plants to remodel cell walls [29]. Divergence 
from the ancestral sequence over time has allowed dif- 
ferent microorganisms to target a range of pectin-like 
substrates, while the overall structure has been main- 
tained [30]. It is of little surprise that phages have 
evolved the ability to acquire these proteins and utilize 
their polysaccharide modifying properties. In the case of 
OSH19 Tsp2 it could be postulated that the target is the 
a-D-galactose (1-4) a-D-mannose linkage found in the 
S. Typhimurium O-antigen trisaccharide repeat. Many 
phage tail-associated proteins have been found to con- 
tain right-handed parallel P -helical structural domains, 
for example phages P22 [31], Sf6 [32], HK620 [33] and 
Det7 [27]. Moreover the presence of a pectate lyase-type 
structure in a phage tail spike protein has previously 
been reported for K5 lyase (KflA) of coliphages K5A 
and Kl-5. KflA targets and degrades the capsular poly- 
saccharide of E. coli K5, thus allowing the phage access 



to outer membrane receptors [34]. Whilst the precise 
identity of the polysaccharide target of OSH19 Tsp2 is 
at present unknown, it is possible that this tail spike 
plays a major role in host-range determination. Plaque 
assays using phages ViOl and OSH19 against a panel of 
S. Typhimurium and S. Typhi BRD948 indicate that 
these phages are specific for their host (Table 1). This is 
perhaps not surprising since ViOl orf 171c (ViOl Tsp2) 
possesses a 9-0-acetyl esterase/acetylxylan esterase 
domain (residues 343-445; DUF303) that targets the 
acetyl-modification of the S. Typhi Vi capsule [22]. The 
pectate lyase domain found in OSH19 Tsp2 is more 
than likely involved in the degradation of a polysacchar- 
ide present on the surface of S. Typhimurium U288. 
Anany et al. [23] also report that SboM-AG3 is Shigella- 
specific and is unable to form plaques on lawns of any 
of the Salmonella, Escherichia, and Listeria strains 
tested. 

A protein domain composed of parallel right-handed 
P-helices is also found in OSH19 Tsp3-the P22 tail 
spike domain. Many tail spike proteins possess an N- 
terminal binding domain involved in attaching the pro- 
tein to the virion head/tail structure (the sequence of 
which is often highly conserved between phages with 
similar morphologies), a central catalytic domain (con- 
taining the trimeric parallel right-handed P-helices), and 
a C-terminal trimerization domain (involved in stabiliza- 
tion of the trimeric tail spike). The trimeric P-helices 
that form the catalytic domain bind and cleave polysac- 
charides present in bacterial lipopolysaccharide (LPS). 
P22 tail spike (Gp9) utilizes this endo-rhamnosidase 
activity to degrade the Salmonella O antigen [35]. Clea- 
vage of a-L-rhamnose (1-3) a-D-galactose found in the 
trisaccharide-repeats of S. Typhimurium LPS takes place 
on the external surface of the tail spike. The structure 
and arrangement of the parallel p -helix tail spike 
domains result in a solvent-exposed exterior that fea- 
tures the proposed catalytic residues Glu-359, Asp-392, 
and Asp-395 [27,36]. The active site residues and their 
sequence environments are conserved in OSH19 Tsp3, 
and therefore can be postulated to operate in a manner 
akin to the P22 tail spike and its relatives. Thus, there is 
a possibility that OSH19 Tsp3 acts in two distinct ways. 
Firstly, OSH19 Tsp3 may function as an initial step 
towards irreversible adsorption of the phage particle to 
its host, hydrolyzing the outer S. Typhimurium LPS 
layer in order to provide access to a previously inacces- 
sible outer membrane receptor. Secondly, the endo- 
rhamnosidase activity associated with P22-like tail spikes 
may aid phage progeny in freeing themselves from cellu- 
lar debris encountered during host lysis [37]. 

Tsp3 of OSH19 has an N-terminal sequence similar to 
ViOl orf 172c (ViOl Tsp3), SboM-AG3 orf 00207 
(SboM-AG3 Tspl) and Det7 tail spike. These phages are 
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all Myoviridae of similar morphologies. Therefore the 
requirements of having to attach the tail spike to similar 
base plate structures more than likely places constraints 
on sequence divergence. Likewise, the alignments for 
the Podoviridae tail spikes show N-terminal conserva- 
tion between related phage, followed by conserved cata- 
lytic and C-terminal domains that are shared between 
Myoviridae, Podoviridae and Siphoviridae. Interestingly, 
P22, ST64T, and ST104 as members of the Podoviridae 
are 98-99% identical over their 667 amino acid tail spike 
proteins, and similarly Siphoviridae members SETP12 
and SETP13 also have almost identical tail spike 
sequences. Despite that the SETP 12 and 13 phage 
sequences are markedly different from P22, ST64T, and 
ST104 over the N-terminal regions; however, they 
become conserved between residues 140 to 684. As 
noted above, there appears to be sequence divergence 
constraints placed on conserved N-terminal regions 
associated with the myoviruses OSH19, ViOl, SboM- 
AG3 and Det7. Similarly it appears there are structural 
restrictions imposed on the Podoviridae and Siphoviri- 
dae N-terminal tail spike sequences. Similarities between 
OSH19 Tsp3, Det7 and the P22-like sequences begin ten 
amino acids after the boundary motif G-G-V-G-L-G-A- 
W that appears in OSH19 Tsp3 and Det7 (this motif 
also signals the divergence of the tail spike sequences of 
phages ViOl and SboM-AG3 from OSH19 and Det7). 
OSH19 Tsp3 and Det7 Tsp show striking conservation 
with the P22-like sequences over the catalytic domain, 
up to and including the active site residues Glu-359, 
Asp-392 and Asp-395. However, immediately after the 
active site in OSH19 Tsp3 (residues 440-698) sequence 
conservation breaks down. 

For P22-like tail spike proteins, the C-terminal residues 
585-596 form an intertwined region that allows extensive 
hydrogen bonding between subunits, whilst residues 606- 
664 form a five-stranded and a three-stranded P -sheet 
region [27]. ClustalW2 alignment of these regions show 
that all the aligned P22-like tail spikes contain conserved 
residues at these locations (including the Myovirus Det7), 
that set them apart from OSH19 Tsp3 (Figure 4C). The 
eight amino acids spanning residues 585-592 (V/G-G-P/ 
A-G-S/T-G-S-A-W) retain a sequence motif similar to 
that found at the N-terminal boundary of OSH19 Tsp3, 
and the tail-associated proteins aligned in Figure 4A. 
However, this motif is absent at the C-terminal end of 
OSH19 Tsp3. P22-like tail spike modules may well have 
descended from a common ancestor with the flexibility 
to diverge their protein sequences whilst retaining the 
topology of the right-handed parallel p -helices. It has 
been proposed that this flexibility has allowed the protein 
domain to become widespread amongst bacteriophages 
[33], with the possibility of later domain interchange 
creating further diversity [38]. 



In summary, we have sequenced the genome of 
OSH19, a promising candidate for phage biosanitization 
of S. Typhimurium in the food production environment. 
The 157,785 bp circular dsDNA genome was found to 
contain no genes associated with toxicity or lysogeny, 
which is a pre-requisite for such applications. Analysis 
of the nucleotide sequence of OSH19 revealed only two 
close relatives in the database: S. Typhi-specific ViOl 
and Shigella-specific SboM-AG3 that collectively form 
the ViOl -like phage family. Major differences were evi- 
dent between OSH19 and ViOl in three different tail 
spike proteins. Two tail spikes from OSH19 contain 
protein domains associated with the degradation of 
polysaccharides common to Salmonella LPS, namely the 
pectate lyase (Tsp2) and P22 tail spike-like (Tsp3) 
families of proteins. The acquisition of these domains is 
the most obvious reason to explain the different host 
specificities of OSH19, ViOl (seemingly host-restricted 
due to the presence of a Vi antigen-degrading tail spike 
protein domain) and SboM-AG3 (whose tail spikes are 
as yet undefined). Based on amino acid sequence analy- 
sis the tail spikes from all three phages appear to form 
right-handed parallel P -helical structures. This appears 
to be an evolutionarily-conserved structure for all three 
tail spikes found in the ViOl -like phage. However, this 
conservation is coupled to the exchange of individual 
protein domains within these structures that may ulti- 
mately determine host range. 

Materials and methods 

Isolation of OSH19 

For phage isolation, sewage effluent was filtered through 
0.2 |im Minisart filters (Sartorius Biotech. Cat. No. 
16534) and the filtrate collected in sterile universals and 
stored at 4°C until required. For S. Typhimurium lawn 
preparation NZCYM broth (Difco Cat. No. 240410) cul- 
tures were prepared and incubated overnight at 37°C 
with shaking. Following this, the overnight culture was 
used to seed fresh NZCYM broth containing 10 mM 
MgS0 4 (Sigma Aldrich Chemicals Cat. No. M2643) 
which was then incubated for 2 hours at 37°C with 
shaking. To molten (tempered to ~ 50°C) NZCYM top 
agar containing 0.6% Bacteriological Agar No.l (Oxoid 
Cat No. LP0011) 500 [iL of the required Salmonella was 
added, followed by 500 [iL filtrate, and the mixture was 
poured onto NZCYM agar plates. The plates were left 
to set on the bench for 20 minutes before being inverted 
then incubated overnight at 37°C. Any plaques identified 
were picked using sterile pipette tips and resuspended in 
500 [iL SM buffer (50 mM Tris-HCl [pH7.5], 100 mM 
NaCl, 8 mM MgS0 4 :7H 2 0, 0.01% gelatin, pH 7.5), incu- 
bated at 37°C for 1 hour, then serial diluted in SM buf- 
fer. A 25 |iL volume of each dilution was then added to 
Salmonella /NZCYM top agar, and lawns were prepared 
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as described above. This process was repeated three 
times in order to obtain a single clonal isolate of 
OSH19. 

Phage host range assays 

The ability of phages OSH19 and ViOl to lyse various S. 
Typhimurium serovars and S. Typhi BRD948 (kindly 
provided by D. Pickard Wellcome Trust Sanger Centre, 
UK) was determined as follows. S. Typhimurium and S. 
Typhi BRD948 top agar lawns were prepared as 
described above, with the exception of the latter being 
grown on supplemented minimal media as described by 
Tacket et al. [39]. To each bacterial lawn triplicate 20 
uL volumes of log 10 7 PFU ml/ 1 dilutions of the required 
phage were applied. Following a sufficient drying period, 
plates were inverted and incubated overnight at 37°C. 
The following day, each plate was observed for lysis and 
the results are shown in Table 1. 

Transmission electron microscopy 

A freshly-prepared high titre phage suspension of 
OSH19 was sedimented at 34,900 x g for 2 hours (4°C). 
Following centrifugation, the supernatant was decanted 
and each phage pellet was washed twice with 0.1 M 
ammonium acetate for 1 hour at 25,000 x g. The wash 
solution was discarded and 2 mL SM buffer added to 
each centrifuge tube. Phage pellets were recovered fol- 
lowing overnight incubation at 4°C with gentle shaking. 
A small drop of washed phage suspension was spotted 
onto a carbon-coated copper mesh grid and allowed to 
sit for 3 minutes. Excess phage suspension was then 
removed with filter paper. For negative staining one 
drop of phosphotungstic acid [pH 7.4] was added to 
each grid, and excess stain was removed one minute 
later with filter paper. Each grid was then covered and 
allowed to dry for 15 minutes. Images were taken with a 
Fei Tecnai Biotwin TEM (Fei Company, USA). 

OSH19 DNA extraction and sequencing 

A high titre suspension of OSH19 (~ 10 10 PFU mL" 1 ) 
was subjected to a single-step caesium chloride (CsCl) 
purification procedure as follows. CsCl (Melfords Cat. 
No. MB 1006) was added to the high titre phage suspen- 
sion to a final concentration of 0.75 g mL" 1 . Samples 
were then subjected to ultracentrifugation at 264,000 x 
g for 24 hours in a Beckman TL100 ultracentrifuge (15° 
C). Extraction of the band containing purified highly- 
concentrated OSH19 was then performed with a sterile 
20-gauge hypodermic needle, and the band was subse- 
quently collected in a sterile tube. Residual CsCl was 
removed from the sample using an Amicon® Ultra-0.5 
30 kDa MWCO centrifugal filter device (Millipore Cat. 
No. UFC 503008) as follows. Briefly, phage-CsCl solu- 
tion was added to the column, which was then spun at 



17,900 x g in a benchtop centrifuge for 30 mins. The 
column was then washed twice with SM buffer at 
17,900 x g for 2 minutes. To elute the retained phage, 
SM buffer was added to the column which was then 
inverted and placed in a fresh collection tube. The col- 
umn was then centrifuged at 17,900 x g for 10 minutes 
to recover the phage. OSH19 DNA was isolated from 
the purified stock using the phenol-chloroform extrac- 
tion method with slight modifications as follows. An 
equal volume of 10 mg mL" 1 Proteinase K (Fisher Scien- 
tific Cat No. BPE 1700-500) was added to the CsCl-pur- 
ified sample, followed by detergent solution (10 ng mL" 1 
Proteinase K in 50 mM EDTA, 50 mM Tris-HCl [pH 
8], 1% N-lauroyl sarcosine (Sigma Aldrich Chemicals, 
Cat No. L9150)). The solution was then incubated over- 
night at 55°C. Following this, phage DNA was extracted 
using phenol: chloroform and ethanol precipitation [5]. 
The extracted OSH19 DNA was purified using a DNA 
wizard purification kit (Promega, Cat No. A1120) then 
subjected to whole genome amplification using a Repli- 
G kit (Qiagen, UK Cat. No. 150023). Genomic DNA 
was fragmented to 500 bp using a Covaris S2 sonicator 
(Covaris Inc., USA) and libraries constructed using a 
NEBNext DNA Sample Prep Master Mix Set 2 (New 
England Biolabs Cat. No. E6070S). The libraries were 
subsequently sequenced using the Roche 454 GS FLX 
system (Roche Diagnostics, USA). 

The fully sequenced OSH19 genome was annotated 
using a combination of CLC Genomics Workbench (CLC 
Bio, Denmark) and Artemis software [40]. For compari- 
sons between the genomes of OSH19, ViOl, and SboM- 
AG3, ACT software was used [41], and comparison files 
were generated using the web-based programme Double- 
ACT http://www.hpa-bioinfotools.org.uk/pise/double_- 
act.html. Nucleotide and protein searches were 
performed using the Blast search algorithm [42]. For the 
identification of protein domains of known function, 
searches of the Pfam database were made [43], whilst 
protein alignments were generated using ClustalW2 [44] . 
Betawrap was used to identify potential right-handed par- 
allel p -helices in the tail spike proteins encoded by 
OSH19, ViOl, and SboM-AG3 [45]. For the identification 
of phage regulatory elements Phire analysis of the 
OSH19 genome was performed [46], and tRNAscan-SE 
1.21 was used to identify tRNA genes [47]. The complete 
annotated OSH19 genome sequence has been deposited 
in the NCBI database (Genbank Acc. No. JN126049). 
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